Skip to content

fixing loop eval#10

Open
maxtheman wants to merge 7 commits intoopenprose:mainfrom
maxtheman:codex/act-loop-eval
Open

fixing loop eval#10
maxtheman wants to merge 7 commits intoopenprose:mainfrom
maxtheman:codex/act-loop-eval

Conversation

@maxtheman
Copy link

Problem
The actions for evals have been failing sine 2/13

Screenshot 2026-03-16 at 4 17 20 PM

In order to get some reliable evals for this project, I wanted to fix it.

Solution

I believe the core issue was feeding too much data into the claude Read tool at once. ARC can produce 2-3MB, which seems to exceed the limit for claude, and their harness propagates that to the main loop.

Proof that it now runs is here: https://github.com/maxtheman/node-rlm/actions/runs/23169689550

I didn't want to start catching issues randomly, so instead added logic to break the offending json file into several, and author a manifest.

misc

Happy to squash these commits or make any requested style or substance changes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant